AMBIENTUM BIOETHICA BIOLOGIA CHEMIA DIGITALIA DRAMATICA EDUCATIO ARTIS GYMNAST. ENGINEERING EPHEMERIDES EUROPAEA GEOGRAPHIA GEOLOGIA HISTORIA HISTORIA ARTIUM INFORMATICA IURISPRUDENTIA MATHEMATICA MUSICA NEGOTIA OECONOMICA PHILOLOGIA PHILOSOPHIA PHYSICA POLITICA PSYCHOLOGIA-PAEDAGOGIA SOCIOLOGIA THEOLOGIA CATHOLICA THEOLOGIA CATHOLICA LATIN THEOLOGIA GR.-CATH. VARAD THEOLOGIA ORTHODOXA THEOLOGIA REF. TRANSYLVAN
|
|||||||
The STUDIA UNIVERSITATIS BABEŞ-BOLYAI issue article summary The summary of the selected article appears at the bottom of the page. In order to get back to the contents of the issue this article belongs to you have to access the link from the title. In order to see all the articles of the archive which have as author/co-author one of the authors mentioned below, you have to access the link from the author's name. |
|||||||
STUDIA INFORMATICA - Issue no. Sp.Issue 1 / 2009 | |||||||
Article: |
RECOVERING DIACRITICS USING WIKIPEDIA AND GOOGLE. Authors: ADRIAN IFTENE, DIANA TRANDABĂŢ. |
||||||
Abstract: The paper presents a method to restore diacritics using web contexts.The system receives one or more sentences in one language and uses the Googleengine to recover diacritics for the sentence words. The system accuracy is similarto the accuracy of existing systems, but the main advantage comes from factthat it uses resources and tools available for free or that are easy to obtain forother languages, leading us to believe that this approach could be valid for morelanguages. Key words and phrases. Information Retrieval, Diacritics recovery, Wikipedia. |
|||||||